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ABSTRACT 
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Tbe  purpose  of  this  project  Is  to  categorize  cardiac  patients  using  an 
e:Q)ected  utility  criteria.  Bayesian  \ipdating  using  symptomatic  informa- 
tion has  been  a successful  tool  for  disease  diagnosis.  However,  in 
classification  of  cardiac  patients,  discriminant  analysis  using  the 
BMDO^M  computer  package  achieves  comparable  results  with  far  less  effort. 
JDlscrlinlnant  scores  can  be  evaluated  and  normalized  as  probabilistic 
estimates  of  the  patient's  cardiac  condition.  By  incorporating  these 
probabilities  with  subjective  value  estimates  for  possible  mis- 
classlflcations,  the  expected  utility  of  assigning  patients  to  each 
cardiac  category  can  be  C8j.culated.  Patients  are  then  classified  into 
cardiac  categories  baaed  iqpon  highest  expected  utility. 
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1.  nrrROKxrriow 


Typically,  patients  with  chest  pains  appear  at  coronary  units  and  are 
faced  with  a barrage  of  medical  histories,  testizjg  procedures,  and  lengthy 
hospital  stays  before  their  specific  problem  can  be  isolated.  Often, 
patients  suffer  further  deterioration  in  health  before  treatment  actually 
begins.  There  are  varied  symptoms  to  examine,  many  of  which  provide  mini- 
mal information.  There  is  a wide  selection  of  tests  and  experiments  that 
can  be  performed,  often  at  a high  cost  to  the  patient  in  terms  of  both  money 
and  time.  Many  of  the  tests  provide  little  additional  diagnostic  informa- 
tion. 

This  paper  provides  physicians  at  coronary  care  units  with  a non- 
Invaslve  model  for  classification  of  cardiac  patients.  By  applying  the 
teehnl<iues  of  discriminant  analysis  and  expected  utility  decision  making, 
patients  can  be  diagnosed  upon  admission  to  the  coronaiy  care  unit  (CCU), 
using  no  more  than  medical  history,  prodromal  data,  and  routine  testing 
procedures.  One  approach  to  this  problem  involves  calculating  the  proba- 
bilities that  a patient  has  a specific  cardiac  condition,  evaluating  the 
conse<iuences  of  treating  a misclassified  patient,  and  making  a diagnosis 
that  would  maximise  the  "benefits"  to  the  patient.  The  basic  approach 
thus  identifies  patient  categories  for  treatment  more  rapidly  and  at  less 
expense  to  the  patient.  This  in  turn  leads  to  higher  quality  medical  care. 

In  general  mathematical  terms,  we  can  describe  the  decision  problem 
for  a specific  patient  as 

MAX  Ee(S)  (1) 

c 

where 

c c [cj^,  02,.*.,  CjJ 
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and  Eg(S)  is  the  e^qpected  utility  of  placing  a patient  with  syiaptoma  S into 
a category  c.  This  can  be  schematically  represented  as 


ACTIOWS  EmgTED 
UTILITIES 


is  the  utility  of  classifying  and  treating  a true  category  J 
as  a category  i* 

This  general  approach  will  be  described  in  detail  in  the  paper. 


II.  BACKSHOUKD 


In  197U,  the  Cardiology  Division  at  Stanford  University  Medical 
Center  published  a procedure  for  analyzing  the  clinical  course  of  chest 
pain  admissions.  The  pvtrpose  iras  to  identify  coronary  disease  patients 
prior  to  the  actual  incidence  of  infarction.  Examinaticm  of  medical 
histories  and  prodromal  data  could  lead  to  detection  of  potential  infarc- 
tions, and  action  could  be  taken  to  avoid  the  sudden  death  that  frequently 
accompanies  infarction.^ 
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John  S.  Schroeder,  M.D.,  Alfred  P.  Spivak,  M.D.,  and  Irene  Lamb,  R.H., 
all  of  the  Cardiology  Diviaion,  included  the  folloiring  goals  in  their 
analysis: 

A.  To  determine  the  incidence  and  frequency  of  patients  admitted 
to  the  coronary  care  unit  or  coronary  siurveillance  unit  mho  have  had: 

1.  Myocardial  infarction  (M.I.)  ruled  out,  chest  pain  not  of 
cardiac  origin. 

2.  Myocardial  infarction  ruled  out,  unstable  angina  present. 

3.  Subendocardial  infarction. 

4.  Definite  myocardial  infarction. 

5.  Pre- infarction  angina  with  progression  to  myocardial 
Infarction  during  hospitalization. 

B.  To  characterize  and  compare  prodromal  symptoms  of  these  patient 
groins. 

C.  TO  Identify  environmental  or  psycho-social  stresses  which  may 
have  contributed  to  the  onset  of  chest  pain. 

D.  TO  identify  clinical  and  laboratoiy  characteristics  of  the 
pre-infarction  patient  who  will  progress  to  myocardial  infarction  or  death 
during  hospital  stay. 

E.  To  identify  patients  with  high  risk  of  subsequent  myocardial 
infarction  or  death  follcsring  hospital  discharge. 

' f 

This  project  is  primarily  concerned  with  building  a model  to  accom- 
plish their  first  goal.  The  coronary  categories  that  they  established  will  j 
be  used  here.  1 
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III.  METHODOLOGY 

In  establishing  a model,  our  efforts  followed  the  steps  given  below: 

1.  Data  collection. 

2.  Assessment  of  a utility  structture. 

3.  Selection  of  the  key  variables. 

U.  Construction  of  classification  model  based  on  key  variables. 

5.  Validation  of  the  model. 

DATA  COLLECTION 

Dr.  Schroeder  provided  the  data  on  125  coronary  care  unit  patients. 

The  data  Included  historical  information,  prodromal  symptoms,  admission 
symptoms,  laboratory  results,  etc.,  as  well  as  a retrospective  determination 
of  the  disease  category.  We  considered  this  final  determination  to  be  the 
"true"  patient  state.  After  eliminating  two  patients  whose  data  appeared 
to  contain  serious  outliers,  the  following  breakdown  of  patients  remained: 

CATEGORY  NUMBER  OF  PATIENTS 


1. 

M.I.  ruled  out,  non-cardiac  pain 

15 

2. 

M.I.  ruled  out,  \mstable  angina 

55 

3. 

Subendocardial  infarction 

10 

4. 

Definite  myocardial  infarction 

29 

5. 

Pre- infarction,  progress  to  M.I. 

14 

To  insure  that  additional  data  would  be  available  to  later  test  the 
diagnostic  model,  twenty  three  patients  records  were  randomly  removed  from 
the  data  base.  This  left  100  patients  available  for  analysis. 

ASSESSMENT  OF  A UTILITY  STRUCTURE 

Methods  of  clMsifying  patients  into  disease  categories  using  proba- 
bility assessments  are  well  documented.  However,  high  probabilities  of  a 
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disease  category  are  often  insufficient  to  establish  an  optimal  diagnosis. 
Clearly,  it  is  far  more  serious  to  treat  an  infarction  patient  as  a non- 
cardiac pain  victim,  than  it  is  to  treat  a non-cardiac  pain  patient  as  an 
infarction  victim.  We  therefore  looked  at  the  value  or  utility  of  classify- 
ing patients  into  the  various  categories.  Since  Dr.  Schroeder  was  the 
decision  maker  for  this  pro,ject,  we  used  his  utility  structure. 

This  study  used  both  a direct  approach  and  a lottery  approach  to 
evaluate  the  relative  utility  of  the  possible  classification  schemes.  In 
describing  the  classifications,  we  used  the  coronary  categories  specified 
earlier.  In  the  direct  approach.  Dr.  Schroeder  subjectively  rank  ordered 
the  types  of  claissifications  from  best  to  worst.  The  best  outcome  would  be 
to  correctly  identify  a category  1 patient,  \rtiile  the  worst  would  be  to 
place  a category  U patient  into  category  1.  By  arbitrarily  placing  a value 
of  +100  on  the  best  case,  and  a value  of  -100  on  the  worst  case.  Dr.  Schroeder 
evaluated  all  of  the  possibilities  on  a relative  basis. 

o 

In  the  lottery  approach,  we  used  eaqjected  utility  theory.  This  im- 
plies that  if  an  individual'  is  satisfied  with  either  receiving  a guaranteed 
award,  or  taking  part  in  a lottery,  then  the  utility  of  the  sure  award 
equals  the  expected  utility  of  the  lottery.  Dr.  Schroeder  faced  a series 
of  lotteries  similar  to  those  shown  in  Figure  2.  He  was  presented  with 
a guaranteed  classification,  such  as  placing  a true  category  1 into  category 
5,  and  two  probabilistic  outcomes,  such  as  placing  a true  category  1 into 
category  1 and  a true  category  4 into  category  1.  He  had  to  assess  the 
probabilities  that  would  make  him  indifferent  to  the  alternatives.  In 
other  lotteries,  he  was  given  the  possible  outcomes  and  the  associated 
probabilities,  and  he  had  to  select  a sure  outcome  that  would  achieve 


and  probabilities 


Put  Cat.  1 -*  Cat.  1 


Put  Cat,  1 -*  Cat. 


Put  Cat.  4 -*  Cat,  1 

.2 


CALCULATIONS 

U (Cat.  1 - Cat.  5)  » .8  U (Cat.  1 -*  Cat.  1)  + .2  U (Cat.  U - Cat.  1) 

= .8  (100)  + .2  (-100) 

= 60 


IX)TTEKY 


anteed  outcome 


ssible  outcomes 
and  probabilities: 


Put  Cat.  1 - Cat.  1 
/ ^ 

Put  Cat.  1 -•  Cat.  2 

\ Put  Cat.  1 -•  Cat.  5 

a 

CALCULATIONS 

U (Cat.  1 - Cat.  2)  = 

.9  U (Cat.  1 -*  Cat.  1)  + .1  U (Cat.  1 

= 

.9  (100)  4 .1  (60) 

S 

96 

Figure  2 
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Indifference.  We  cranpared  t)  e results  of  the  lottery  eval\iation  method  with 
the  direct  method,  and  we  analyzed  areas  of  discrepancy.  By  varying  out- 
comes and  probabilities,  we  insured  that  Dr.  Schroeder's  choices  were 
consistent.  The  final  utility,  or  benefit,  matrix,  [u],  is  given  in 
Appendix  1.  The  components  of  this  matrix,  Uj^^,  represent  the  utility  of 
treating  a true  category  j patient  as  a category  i patient. 

It  is  important  to  stress  that  the  values  in  the  utility  matrix  are 
Dr,  Schroeder's  alone.  This  mi^t  be  impropriate  since  he  is  the  decision 
maker  in  this  case.  Each  decision  maker  will  have  his  own  utility  struc- 
ture which  will  be  a function  of  his  experience,  environment,  and  attitudes. 
This  point  has  been  one  of  great  controversy.  Is  it  the  doctor's  or  the 
patient's  utility  structure  that  is  most  appropriate?  Perhaps  the  doctor 
should  use  his  experience  to  help  place  the  patient's  utilities  in  perspec- 
tive? We  made  no  attempt  to  argue  this  question  in  our  paper.  We  used 
Dr.  Schroeder's  utility  matrix  which  covild  then  be  subjected  to  sensitivity 
analysis.  With  this  understanding,  we  can  later  use  the  utility  matrix  in 
conjunction  with  the  probabilities  that  a patient  actually  falls  within  a 
category  to  calculate  the  expected  utility  of  classification. 

SEI.ECTION  OF  THE  KEY  VARIABLES 

Since  we  collected  data  on  more  than  100  signs  or  symptoms,  it  was 


!> 


As  mentioned  previously,  both  probabilistic  assessment  and  value 
assessment  are  required  for  an  expected  utility  model.  By  applying  the 
B.M.D.  Biomedical  Data  Programs  developed  at  the  Health  Sciences  Computing 
Facility  at  U.C.L.A.,  we  could  estimate  posterior  probabilities  for  each 
category.  In  particular,  B.'>ID05M  (Discriminant  Analysis  for  Several  Groups) 
was  used.  This,  program  coi!5>utes  linear  discriminant  functions  for  classi- 
fication into  one  of  several  groups.^ 

We  can  select  an  m-conqponent  vector  of  symptoms,  Y,  for  classification, 
and  we  assume  that  the  \inder lying  distribution  is  multivariate  normal.  We 
want  to  classify  a random  patient  k with  symptom  vector  Y into  one  of  the 
five  coronary  categories  1,  2,  ...  5.  We  further  assume  that  the  symptoms 
for  each  category  are  distributed  N(p,j,  and  that  = ^ = ...  = e 

Using  the  sample  parameters  and  Sj  as  estimates  of  the  population 
parameters  jij  and  Ej,  BMD05M  performs  the  following  computational  steps: 
Required  transgenerations  on  the  data 
Means  of  symptoms  for  each  category 
S\jra  of  products  of  deviation  from  the  mean 
Pooled  dispersion  matrix  and  its  inverse 
Kahalanobis  distance 

Coefficients  and  constant  termoTj^Q  for  each  category  i 

Using  these  or 's,  the  discriminating  function  for  the  i^^  category  is 
calculated  as: 

n 

^i(Xk)  = 2]ykd»ij  + «io  (2) 

j»i 

where  m is  the  number  of  symptoms,  i is  the  category  number,  and  k is  the 
patient  number. 
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The  discriminant  scores  are  normalized  to  represent  probabilities  for 


each  category  as  follows: 


Pj'ob  {i|Yj^} 


(fi(Tfk)  - Ma*) 


n=l 


- f(Yj^)  Max) 


(3) 


The  typical  output  of  BMDO51-I  includes  these  probsOjilities  as  well  as  a table 
of  classifications  based  upon  largest  probability. 

To  classify  patients  based  upon  a utility  criteria  rather  than  upon 
a largest  probabilityj  we  modified  BMD05M  to  calculate  the  utilities 

of  classifying  each  patient  k into  each  category  i.  Using  the  utility 
matrix,  [u],  and  the  probabilities  from  the  discriminant  analysis,  we  calcu- 
lated the  expected  utility  of  classifying  patient  k into  category  i as: 
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n=l 

Classification  is  then  based  upon  the  hipest  eicpectei  utility.  We  then 
calculated  the  average  expected  utility  for  all  categories  i,  (the  entire 
100  patient  data  base)  as: 

100  5 100 

E(average)  = ( ZZ  Prob{n|^}  Uni)/100  = J|^Ei(Yj^)/lOO  (5) 

k=l  n=l  k=l 

The  procedure  is  schematically  portrayed  in  Figure  3. 


9 


The  problem  now  remained  to  determine  which  symptoms  provided  the  most 
useful  discrimination.  We  examined  many  combinations  of  symptoms  and  calcu- 
lated the  average  eaqpected  utility  using  each  symptom  set.  We  found  that 
ridi  factors  and  prodromal  data  generally  were  conparable  for  all  categories 
and  added  little  to  the  ability  to  discriminate.  The  most  discriminating 
symptoms  were  found  in  the  following  information: 

Unstable  angina  descriptions 
Admission  chest  pain  descriptions 
Electrocardiograph  (EKO) 

ST.  segment  and  shift 
Throve  and  old  Q-wave 

Enzyme  shift  for  serum  glutamic  oxaloacetic  transaminase  (SCOT) 
Enzyme  shift  for  creatine  phosphokinase  (CHC) 

While  the  other  symptoms  certainly  provided  some  information,  they  provided 
no  marginal  assistance  in  our  ability  to  distinguish  between  categories. 

These  observations  established  a l4  conponent  symptom  vector.  We 
co\ild  now  use  this  vector  as  a basis  for  classifying  new  patients.  It  Is 
inportant  to  note  that  classifying  category  U patients  is  a sinple  task. 
EKG,  laistable  angina,  and  serum  enzymes  are  sufficient  for  an  extremely 
reliable  diagnosis.  Category  4 patients  are  rarely  misdiagnosed.  As  a 
result,  we  were  more  interested  in  a scheme  of  classifying  the  other  cate- 
gories of  patients. 

C0B3TRUCT10W  OF  THE  MODEL  BASED  OH  KEY  VARIABLES 

Using  the  vector  of  key  symptoms  and  the  data  base  Information,  we 
obtained  the  most  useful  set  of  a’s  for  each  coronary  category.  We  then 
developed  a computer  algorithm  to  evaluate  new  patients.  The  vectors 
provided  the  discriminant  scores  for  each  patient  q for  each  category  i. 


We  used  this  in  turn  to  calculate  the  prdbahilitles  and  expected  utility 


for  each  category  as  follows 


The  model  was  now  con^lete  for  diagnosing  any  future  cardiac  patients 


VALIDATIPN  OF  THE  I40DEL 


Using  the  discriminant  coefficients  (r)  obtained  for  the  most 
discriminating  symptoms , we  now  had  to  determine  if  our  approach  would  moA 
on  additional  patients.  By  sG^lylng  the  model  to  the  23  patients  removed 
from  the  original  data  base,  we  calculated  the  discriminant  scores » 
probabilities y and  eigpected  classification  utilities  of  classifying  each 
patient  into  each  category.  Diagnosis  was  based  upon  hipest  expected 
utility.  Twenty-three  patients  are  far  too  few  observations  to  make  firm 


conclusions,  but  they  allowed  us  to  draw  informal  Inferences  about  the 


validity  of  the  model 


If  all  patients  had  been  correctly  categorized,  we  would  have 
achieved  an  eiqsected  average  classification  utility  of  85.  We  actually 
achieved  value  of  77.6.  Wb  found  this  to  be  most  encouraging.  Of  the 
23  patients,  we  found  two  instances  of  what  can  be  called  serious 
ndsclassificatlons.  These  represent  patients  that  were  classified  into 


categories  less  severe  in  nattire  tihan  their  trw  category.  We  compared 


these  results  with  classification  hased  upon  largest  probability,  and  found 
five  serious  mlsclasslflcations  using  the  latter  approach,  ffe  also  found 
several  misdiagnoses  in  which  patients  were  placed  into  a category  more 
severe  than  their  true  category,  but  these  are  far  more  toleraible.  They 
represent  instances  in  which  the  patient  receives  extra  care. 

Although  23  patients  provided  insufficient  information  to  place  com- 
plete confidence  in  the  model,  the  results  we  achieved  were  far  too  success- 
ful to  invalidate  it.  It  is  clear  that  we  must  now  obtain  additional  data 
before  we  can  msike  suitable  statistical  Inferences  about  our  results. 

There  are  other  modifications  to  the  model  that  can  also  be  examined 
in  the  future.  We  can  perform  sensitivity  on  the  utility  matrix  and 
e3Q>lore  the  question  of  whose  preference  structure  should  be  used.  This 
model  dealt  with  non-invasive  symptom  input,  but  we  might  lo<dc  at  other 
procedures  such  as  angiography.  Although  an  angiogram  is  consldeied  to  be 
a surgical  procedure,  the  benefits  gained  might  exceed  the  difficiilties 
associated  with  it.  Additionally,  we  could  extend  the  general  model  to 
other  areas  of  medical  practice. 

OOWCLUSIOWS 

The  e^qpected  utility  classification  model,  using  discriminant  analy- 
sis and  subjective  value  assessment,  is  a worthwhile  tool  for  coronary  care 
patient  classification.  It  provides  the  decision  maker  with  a rapid, 
inexpensive  method  of  classifying  patients  without  invasive  procedures. 

The  model  is  a recursive  one  that  is  continuously  \q>dated  and  refined  as 
new  patient  data  becomes  available.  The  approach  is  flexible  and  is 
designed  to  fit  the  decision  maker's  expertise  and  the  appropriate  pre- 
ference structure. 
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Initial  effbrts  in  applying  tlie  model  to  actual  diagnoaia  hanre  provan 


aueceaaful  and  provide  the  impetua  to  ei^tand  the  data  baae.  Ihia  dx>uld 
make  the  model  more  reliaible  and  allov  doctors  to  tiae  it  vith  confidence. 


This  in  turn,  will  provide  the  patient  with  higher  quality  medical  care. 


APPEHDIX  2 


Table  of  SytQptoms  (After  I^runlng) 

Age 

Sex 

lfunft>er  of  montha  known  heart  dlaeaae 
Ride  factors 

hypertension 
obesity 
hyperlipids 
diabetes  mellitus 
smoking 

family  history  of  heart  disease 
Prodromal  symptoms 
generalized 
cardiovascular 
gastrointestinal 
personality 

Unstable  angina  description 
Admission  chest  pain  description 
EKG 

ST.  segment  and  shift 
T wave  and  Q wave 
Serum  enzyme  shifts 
SCOT 
CPK 
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The  purpose  of  this  project  is  to  categorise  cardiac  patients  using  an  expected 
utility  criteria.  Bayesian  updating  using  synptonatle  Infoznatlon  has  been  a 
successAil  tool  for  disease  diagnosis.  However,  In  classification  of  cardiac 
patients,  dlscrlnlnaat  analysis  using  the  BIDO^I  conputer  package  achieves 
coBpardbls  results  with  far  less  effort.  Dlscrlninaiit  scores  can  be  evaluated 
and  nornallsed  as  prcMblllstlc  estlnates  of  the  patient's  cardiac  condition. 

By  Incorporating  these  prohShlUtles  with  subjective  value  estlnates  for  pos- 
sible nls-classlfleatlena,  the  expected  utility  of  assigning  patients  (eon't) 
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to  each  cardiac  category  can  be  calculated.  Fatieute  are  then  claaclfled 
Into  eardlae  eategorlec  based  uponUghest  expected  utility. 
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